Word Sense Disambiguation Techniques for Indian and other Asian Languages: A Survey
نویسندگان
چکیده
Natural Languages used by people for establishing proper communication consist of many words having multiple meanings known as polysemous but implies a single sense depending on the context. Word sense disambiguation is a method of determining the appropriate sense of a polysemous word in the context. WSD is almost finished for English. It is a challenging task for Indian languages since these are morphologically rich in nature and development of various resources like machine readable dictionaries, WordNet etc. are in progress. We have discussed the unsupervised Graph based WSD for English. Then, we have discussed the various efforts accomplished by several researchers to develop WSD systems for Indian languages like Hindi, Kannada, Malayalam, and Assamese. Finally, we have discussed about WSD for other Asian languages like Nepali, Arabic and Myanmar.
منابع مشابه
Different Techniques Implemented in Gurumukhi Word Sense Disambiguation
One of the most important issues in the field of Natural Language Engineering is Word Sense Disambiguation (WSD).Gurumukhi or more commonly known as Punjabi, is world’s 12th most widely spoken language and this language is morphologically rich. But surprisingly, there are relatively less efforts in the field of computerization and development of lexical resources of this language. It is therefo...
متن کاملA Survey on Parts of Speech Tagging for Indian Languages
Part of speech (POS) tagging is basically the process of automatically assigning its lexical category to each word according to its context and definition. Each word of sentence is marked in croups as corresponding to a particular part of speech like noun, verb, adjective and adverb. POS serves as a first step in natural language process applications like information extraction, parsing, and wo...
متن کاملInfluence of Morphology in Word Sense Disambiguation for Tamil
Many Word Sense Disambiguation (WSD) algorithms do not take into account the morphological variations in the language. However, as Indian languages are highly inflected languages, we investigate whether morphology must be taken into account for WSD for Indian languages, as they are very rich in morphology. This paper analyses the influence of morphology in WSD for Tamil. We believe our results ...
متن کاملWord sense disambiguation: a survey
In this paper, we made a survey on Word Sense Disambiguation (WSD). Near about in all major languages around the world, research in WSD has been conducted upto different extents. In this paper, we have gone through a survey regarding the different approaches adopted in different research works, the State of the Art in the performance in this domain, recent works in different Indian languages an...
متن کاملParts Of Speech Tagging for Indian Languages: A Literature Survey
Part of speech (POS) tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. In many Natural Language Processing applications such as word sense disambiguation, information retrieval, information processing, parsing, question answering, and machine translation, POS tagging is considered as the one of the basic necessary tool...
متن کامل